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Abstract — In this work, the geometric relation between space 
time block code design for the coherent channel and its non- 
coherent counterpart is exploited to get an analogue of the 
information theoretic inequality I(X\ S) < I((X, H)\ S) in terms 
of diversity. It provides a lower bound on the performance of 
non-coherent codes when used in coherent scenarios. This leads 
in turn to a code design decomposition result splitting coherent 
code design into two complexity reduced sub tasks. Moreover 
a geometrical criterion for high performance space time code 
design is derived. 



I. Introduction 

In MIMO (Multiple Input Multiple Output) systems space 
time coding schemes have been proven to be an appropri- 
ate tool to exploit the spatial diversity gains. Two distinct 
scenarios are common, whether the channel coefficients are 
known (coherent scenario) [1], to the receiver or not (non- 
coherent scenario) [2]. Prominent coherent codes are the 
well known Alamouti scheme [3] and general orthogonal 
designs [4]. A more flexible coding scheme are the so-called 
linear dispersion codes. They have been introduced in [5] and 
were further investigated in [6]. A full rate high performing 
example is the recently discovered Golden code [7]. Genuine 
non-coherent codes have been proposed in [8], but most 
of the research efforts in the literature focus on differential 
schemes, introduced in [9], since differential codes usually 
provide higher data rates than comparable non differential 
codes. High performing examples have been constructed in 
[10], [1 1],[12],[13]. However, in both (coherent and non- 
coherent) cases most research effort has been undertaken for 
space time block codes with quadratic 2-by-2, resp. n t -by-n t 
code matrices (n t denotes the number of transmit antennas). 
Although linear dispersion codes are not restricted to quadratic 
shape of the design matrices the block length is not a free 
design parameter when the number of transmit antennas is 
held fixed (compare the asymptotic guidelines in [6]). 

In general [14] the signal matrices are of rectangular shape 
of size Txn t with unitary columns. The corresponding coding 
spaces for the coherent and non-coherent scenario are the 
complex Stiefel and Grassmann manifolds respectively. Typ- 
ically the number n t of transmit antennas is a small number 

Journal reference: IEEE Trans. Inform. Theory 52, no 12 (2006), 5324- 
5335. Personal use of this material is permitted. However, permission to 
reprint/republish this material for advertising or promotional purposes or for 
creating new collective works for resale or redistribution to servers or lists, 
or to reuse any copyrighted component of this work in other works must be 
obtained from the IEEE 



due to hardware limitations, while the block length T can 
be chosen rather large, upper bounded only by the coherence 
length of the channel. Inspired from [15], in [16] a general 
analysis of packings in the Stiefel and Grassmann manifold 
revealed, that the achievable squared minimal distance (i.e. 
the squared diameter of the decision regions for decoding) 
grows proportionally with the block length T, more precisely 
the following proposition holds [16]): 

Proposition 1.1 

For any T > 2nt set T)n t ,T — n t (2T — n t ) (coherent 
channel), resp. D nt T — 2n t (T — n t ) (non- coherent chan- 
nel). Then for any prescribed rate R — y log|C| there exist 
space time block codes C with rate R and minimal distance 
dg satisfying 



d > CJp 



(provided p > 1) (1) 



for some constant C > depending on the channel 
knowledge at the receiver. Since the rightmost term is 
monotonically increasing as a function of T, the receiver 
performance increases proportionally to 

Having the common literature (see above) in mind, this result 
comes rather unexpected and further research effort seems 
promising. However, explicit code constructions have already 
been achieved in [17]. Moreover, the Proposition 1.1 becomes 
even more important when considering space frequency code 
design: The schemes [18], [19] indicate, that the relevant 
coding spaces are certain subsets of (large dimensional) Stiefel 
and Grassmann manifolds. Thus considering these coding 
spaces in general may be of considerable importance for space 
frequency code designs. Explicit space frequency construc- 
tions can be found in e.g. [20]. 

In the present work it will be shown, how general space 
time block code designs can be decomposed into two 'smaller' 
pieces with reduced design complexity (Theorem III. 5), both 
already in the focus of current research. The achieved result 
can be seen as complementary to that of Kammoun and 
Belfiore [21], who presented a coding scheme for non-coherent 
channel space time block codes in terms of coherent channel 
ones, compare Remark III. 7 for further implications. 

The key observation is the quite intuitive but technically 
not obvious diversity monotonicity (Proposition HL4), which 
states that the performance of each non-coherent channel space 
time block code grows when considered as a coherent channel 
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code. This turns out to be due to some higher resolution of the 
coherent channel receiver, reflecting the information theoretic 
relation between the system designs. 

Further insight on the performance is obtained by local anal- 
ysis of diversity, leading to the overall picture of space time 
coding as a constrained sphere packing problem. It reveals 
additional structures not obvious from the traditional point of 
view, proposing high performance design criteria (Conclusion 
IV.2, IV.4) and adding a further estimate (Proposition IV.5) to 
the diversity embedding. By the way all results are obtained in 
the spirit of geometrical methods in space time coding theory. 

The remainder of this work is organized as follows. Section 

II introduces the basic models for the channels and coding 
spaces (with emphasis on their geometrical structures), fixes 
notation and conventions used throughout this work. Section 

III defines diversities for the coherent/non-coherent channel 
cases as our fundamental performance measure and analyses 
their interrelations, culminating in the embedding and decom- 
position results mentioned above. Section IV focuses on the 
local analysis of diversity and the connection to the sphere 
packing problem, exploring its consequences. Finally, the main 
results will be summarized for concise reference together with 
remaining open questions. 

II. Channel model and coding spaces 

In this section the basic channel model will be presented, 
leading to the Stiefel and Grassmann manifolds as coding 
spaces. These spaces will be introduced with emphasis on their 
topological metric structures induced by the maximum like- 
lihood receivers. The geometric relation between the coding 
spaces is precisely expressed by the principal fiber structure, 
which is also introduced here. Although the geometric terms 
used in this work will be defined (as far as it seems necessary 
to understand the concepts), the reader who prefers rigorous 
definitions is invited to consult standard text books e.g. [22], 
[23] (manifolds), [23], [24], [25] (homogeneous spaces, Lie 
groups), and/or [23], [25] (principal fibers). For the particular 
case of the (complex) Stiefel and Grassmann manifolds an 
introduction to their real counterparts aimed at non-specialists 
is [26]. 

A. Channel model 

We consider the Rayleigh flat fading MIMO (multiple 
input multiple output) channel without channel knowledge 
at the transmitter and maximum likelihood decoding at the 
receiver as described in [14] (with normed expected power 



E,E| 



expectation): 



1 per time step, i = 1,...,T, E denotes 



X = ^SH + W , 

S = { Sij ) g C Tx "% H e c IltX ™", x, w e C Txn - 



(2) 



whereas T denotes the coherence time of the channel (respec- 
tively the block length of the signals), n t , n r denote the num- 
ber of transmit, resp. receiver antennas, W ~ i.i.d. CAf(Q, 1) 
is the noise, H <~ i.i.d. CAf(Q, 1) the channel matrix and S, X 
denote the transmitted, resp. received signal with SNR (signal 



to noise ration) p. The (ergodic) channel capacity is defined 
by the supremum of the mutual information 

C = ^ sup I{(X, H);S), resp. C = ± sup I(X- S) (3) 



T 



P(S) 



T 



P(S) 



for the coherent (resp. non-coherent) channel, and we define 
the rate R of the code C by 



R-= ylOg|C| 



(4) 



The normalization by 1 /T is merely a convention to have the 
block length T as a free design parameter of the code, such 
that codes with distinct block length are comparable. 



B. Coding spaces 

Hochwald and Marzetta [14, Theorem 1 and 2] have shown, 
that signals S of the form S = ^=$£ are optimal with 
respect to the channel capacity (due to the central limit 
theorem tending to Co defined below, when T — ► oo), if 
the receiver does not know the channel. More precisely one 
has T > n t , £ — diag(ei) G C™ tXnt with non-negative, 
Ee? = T stochastic independent from $, obeying $t$ — 
(n t x n t -unit matrix), <j> therefore being canonically an element 
of the complex Stiefel manifold V® T defined below. In [2] 
Hochwald/Marzetta, and more generally Zheng/Tse in [27, 
Lemma 8] have shown that the optimal energy allocation £ 
of the antennas equals (asymptotically in p) £ = VTl, thus 



(5) 



— $ then carries 



for n t > n r , T > 2n r . The signal S = 
the total energy \\S\\j? = T, thus the transmitter sends with 
unit power per time step. In this case the mutual information 
(ergodic in the channel realizations) I(X; S) depends only on 
the subspace in C T spanned by the columns of $, not on 
$ itself [27]. This is reflected by the fact, that scalings and 
linear combinations of the columns of $ are indistinguishable 
for the detector, when the channel is non-coherent. Therefore 
these transformations cannot carry any information and we end 

up with signals S G \j~^~ t @n t t> @n t t denoting the complex 
Grassmann manifold of n t -dimensional linear subspaces of 
C T . 

For the coherent channel the capacity has been calculated 
by Telatar [28] to 



(6) 



Assuming the same energy allocation £ = \/Tl one can jus- 
tify, that now the asymptotically optimal signal space consists 
of signals S e ^V^ T . 

We focus on both signal designs in this article, sometimes 
called unitary space time modulation in the literature (intro- 
duced in [2]). 
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C. Coherent channel: The Stiefel manifold V^ t T 
The (complex) Stiefel manifold defined by 

:= {$ e C Txnt |$ f $ = 1} 



v. 



n t ,T 



(7) 



the metric d remains invariant under left or right multiplication 
of its arguments with unitary matrices (also denoted as left 
invariance resp. right invariance): 



is diffeomorphic to a coset space with respect to the unitary 
group U(T) of T-by-T unitary matrices: 



d{U<f>v,UVv) = 



(16) 



VZt = U(T)/(1 



$ = (J) (8) 



"n t ,T = u V J ) I \0 U(T-n t ), 

whereas = means 'diffeomorphic to'. From this equivalence 
we obtain 



This property is one motivation for the geometric picture 
of the Stiefel manifold as a homogeneous space with its 
corresponding left and right actions. Furthermore for each 
singular value cjj holds 



D nt ,T ■= dirriR V nuT 
= n t (2T-n t ) 



dim R U(T) - dim R U(T-n t ) 



1 ^-2 

2 i 



WW 

= Ai(l - *($ttf)) = i - A„ t _ i+1 H$t^)) 
whereas "A'/ := 1/2(M + M*) denotes the hermitian part. 



(17) 



(9) 

for free. Since the elements of the Stiefel manifolds are Tri- 
dimensional orthonormal bases, they are called n t -frames. D - Non-Coherent channel: The Grassmann manifold G® t T 
Geometrically the coset representation of V® t T is interpreted The (complex) Grassmann manifold Gf lt T is the set of all 
as a so-called homogeneous space 

(u(t)^IvZt; U(T- 



n t -dimensional (complex) linear subspaces of C 



T. 



n t ) 



(10) 



(18) 



This means that each $ e V® T is the image of a projection 
TTy from some unitary T-by-T matrix U (in the coset repre- 
sentation 7Ty is simply the projection on the first n t columns 
of U = ($,$-"-)) and for each * e T there exist an 
unitary with <3/ = The latter property is obviously 

fulfilled and called a transitive left action of the group U(T) on 
V^ t t ( me defining property for V® t being a homogeneous 
space), while the former property means that iTy is invariant 
with respect to the right action of U(T — n t ) on V^ t t- 

As a linear algebraic convention used in this work, eigen- 
values Xi and singular values Oi of matrices will be arranged 
in decreasing order, thus Ai > • • • > A„ t , and a\ > ■ ■ ■ > a nt . 

A code C v for the coherent channel model is given by a 
discrete set C v = C T . At the receiver the maximum 
likelihood decision reads (see [2]) 



{<<*>> \**k,t} 

whereas ($) denotes the column space of <&. Since $ i — > 
($) is a projection invariant under all n t -by-n t unitary basis 
transformations we get the coset representation 

'(U{n t ) 



U n t ,T 



U(T) 





U{T-n t ) 



) , (*> 



$$ 1 



(19) 



:= (1,0)$) and 

D nt ,T ■= dim R G^ t T 
= dim R *7(T)- 
= 2n*(T-nt) 



dim R C/ (n t ) - dim R U(T - n t ) 



(20) 



ML 



arg mm 



(11) 



whereas X = y p^- +VK is the received signal. Throwing 
away the noise term allows a formulation of a code design 
criterion in the signal space V® T , induced from the ML 
receiver: The maximization of the pairwise distances d, given 
by 



Note that the coordinate representation ($) = holds 
only locally in general (since it requires $ x to have full rank), 
but it turns out, that this representation covers all but a set of 
measure zero and we abandon this distinction between local 
and global properties in the sequel and drop the distinction 
between G^ t t an d i ts coordinate domain. 

Again we have a geometrical reformulation in terms of the 
homogeneous space 



U(T) 



Lx-. 



U{n t ) x U(T-n t ) 



(21) 



d(*,tf) := 
where we have set 



(12) 



A:=$-* (13) 
a :=(cti,..., a nt ), (7j := <Ti(A) e [0, 2] (14) 

Thus coding corresponds to a packing problem on the metric 
space (V„ c t T ,d) a) . Note, that by 

<Ti(U(Q-V)v) = at (*-*), V r . 



/(7e(7(T),«e(7(n t ) 



(15) 



a) We will see in section III, that this is only an approximation of the design 
criterion, but the importance of the packing gain will become clear in section 
IV 



whereas the transitive left action now reads (\I>) = 
($) := (t/ ww $) (e.g. choose = £/**)• 

The projection ttq is now invariant with respect to the com- 
bined right action of U(n t ) x U(T — n t ), because not only 
the orthogonal complement of the columns in $ has been 
neglected, but also the particular choice of the spanning n t - 
frame: Each (^j^- 1 -) (jj) represents the same space ($) for 
arbitrary u e U(n t ), v G U(T - n t ). 

To simplify matters let us assume n t < T/2 whenever 
we are in contact with the Grassmann manifold. This is no 
restriction, since for n t > T/2 we can always switch to the 
orthogonal complement of the subspaces under consideration. 
Given now two elements ($) , e G% t t tnen there exist 
n t principal angles < i?i < ■ ■ ■ < i?„ t < 7r/2 between 
($) and ($). They are defined successively by the critical 
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values arccos|<Wi, Wi>\, i = 1, . . . , n t (in increasing order), 
of (v, w) i — ► arccos|<v, w>| where the unit vectors v, w vary 
over {vi, . . . , Vi-i} 1 - C (<&), respectively {wi, . . . , u^-i} 1 - C 
('J), compare [29]. The components of the vector of principal 
angles i? := . . . , i?„ t ) can be computed by the formula 
(any representing n t -frame will do) [29] 



COSl?i = (T i ($ t \I/) 



(22) 



An important application of principal angles on some given 
pair (<&) , (*I>) with principal angles $ is, that due to the 
transitivity of the unitary group action there exist an unitary 
U, such that (say) can always be translated into ( J ) = E/vP 
and in U ($) = one can choose a basis such that we 

end up with the canonical representing n t -frames 



^(cos "&i 
$0 = I (sintf, 




(23) 



(where (costfj) := diag(cosi? i ) i =i i ... int G R"' x ™') for the 
translated spaces (* ) = U ($ ) = U(&). Note, that 
the demand to choose the appropriate basis $ in f/ ($) is 
mandatory, in general there is no U <G U (T) which translates 
the n t -frames <£, $ simultaneously into $o- 

E. The principal fiber structure Pq 

The natural relationship between the homogeneous spaces 
V^ t T and G® tT is subsumed in the canonical principal fiber 
bundle structure 



Pi ■= « 



(24) 



which (locally) embeds G® f t mto t choosing a rep- 
resenting n t -frame $ which spans the subspace ($). However 
there remains the freedom of multiplication with arbitrary 
unitary matrices u e U(n t ) from the right (all of them have 
the same image under the projection itq), and for practical 
applications it is necessary to specify a unique choice for $ 
and u, given ($) (simultaneously for all ($) e T , not only 
for pairs as in (23)). But locally this can always be achieved 
and we do not want to go into details here. The term 'principal 
fiber bundle' means a generalization of the term 'homogeneous 
space', where now the total space V® T no longer need to be 
a group and the base space G^ T is a projection ttq of the 
total space which is invariant under a right action of U(n t ). 
The set of all elements $u is called a fiber over ($). 

This geometrical point of view makes clear, that we can 
consider codes C G C G® f t f° r tne non-coherent channel as 
discrete subsets of V® T in virtue of the local embedding of 
G^ T into V® T . But one motivation for the introduction of 
all these perhaps unfamiliar geometrical terms is to clarify 
the relationship between the coding spaces, i.e. that there 
is no canonical representation of C G in V® T . In practical 
applications this peculiarity is often overlooked, since common 
mathematics software packages already use certain conven- 
tions when representing subspaces in terms of singular value 
decompositions. Furthermore we will see that the unitary left 
and right actions on the coding spaces lead naturally to the 
diversity embedding results derived in the next section. These 



results are geometrical in nature rather than linear algebraic, 
but only in the geometric context it becomes clear, that they are 
not obvious at all, since they relate distinct metric structures. 
For the Stiefel manifold the relevant metric structure has 
already been defined in (12) and for the Grassmann manifold 
we will define it next. 

We consider codes C G C G^ t t a l wa Y s as discrete subsets 
of V® t t an d the maximum likelihood criterion for the non- 
coherent channel receiver reads now ([2]) 



arg max 



(25) 



whereas X = y p^ ^H+W is the received signal. To obtain 

a design criterion in the signal space G^ t T we throw away 
the noise term (as in the coherent channel case) and pass from 
to * e G® uT C Vn t ,T (this operation does not change 
the column space of Setting 



A := 



(26) 



|| A|| F 2 = ]T a\ (A) ( i> ]T cos 2 t = n t - £ sin 2 <?< (27) 

i— 1 i i 

(note that (27) does not depend on the choice of the repre- 
senting n t -frame, thus represents really an entity on G® T ), 
and 

a = (cti, . . . ,q nt ) , q_i \— <Tj(A) = cosi9i (28) 

the ML criterion demands the maximization of the pairwise 
distances 



d(*,*) := y/rn ~ ||A|; 



tr 1 



A f A) 



\E(!-^) = 

\ i=i 



sin 2 j?. 



(29) 



Formally d is defined on all of V® t t> but independent of the 
choice of the representing n t -frame as already indicated. Of 
course, it is a metric in the strict sense only as a function on 
Gn t (known as the 'chordal' distance, compare [30], [15]), 
turning again the coding problem into a packing problem in 
(Gn t ,T' 4)- 11 shares the invariance properties of the coherent 
channel metric d, but satisfies even more: 



d{U<5>v, U^w) = d(*, tf) 



(30) 



by 

a i ((U^v)HU^w))=a i (^^f), V Ue u(T),v, w eu(n t ) (31) 

III. Performance analysis: Diversity 

In practical settings, where p, T <C oo, the receiver metrics 
d, d fail to be the sole code design criteria. Denoting the 
pairwise error probability of mistaking one symbol for another 
at the receiver generically as Pij one gets the union upper 
bound 

for the exact error probability. This section deals with the 
pairwise error probability Chernov bound, more precisely 
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with the diversity, which is essentially the reciprocal of the 
Chernov bound. It turns out, that the receiver metric coincides 
with the first order term of the diversity and the highest 
order term leads to the so called diversity product (further 
analyzed in section IV). Adopting the diversity as the major 
performance measure, section III-A investigates the connection 
between non-coherent and coherent channel designs and its 
consequences for code design. For convenience we fix the pair 
($, of code symbols throughout this section and suppress 
their notation as function arguments. 

For the coherent channel case the pairwise error probability 
has been calculated in [2] to 



Res a 



u + i/2 



n 

i=l 
Si>0 



(33) 



With 5 t := y j t {pT /n t )a'l 

coherent channel case holds [2] 



Analogously for the non- 



LJ + l/2 



n 

i=l 
q_i<l 



(p£) 2 (l-af)(o;2 + g?) 



(34) 



with a 



l + (pT/n t ) 

For both cases the we have the Chernov bound 



1 /"* 



whereas (coherent channel) 



Q = Q ■= 



PL 

respectively (non-coherent channel) 

t 

Q = Q 



(P-) 2 



+ 1/4 4(1 + 



(Ti = X I 



(35) 

(36) 
(37) 

(38) 
(39) 



The term in parentheses in (35) is called (pairwise) diversity 



Viv:=]J[l + ga 2 



(40) 



and we take it as our basic performance measure for codes. 
Rewriting Viv as a polynomial in g requires the use of 
elementary symmetric polynomials defined by syiriQ 4 := 1, 
sym^(xi,...,a;„ t ) = E/^-y) Xj. = Ei j£ (»t) x h ■ ■ ■ x {j 

(with := {(h, . . . , ij) e W 1 1 < ii < ■ ■ ■ < ij < th}), 
j = 1, . . . , n t . With the abbreviation 

*,•:= syrriQ (a*,..., <)= ^ s% (41) 



we find generically 

nt 

Viv = ^ Sl Q l (42) 

i=0 

The first and highest order coefficient of this polynomial are 
of particular importance, since they dominate the diversity in 
the low and high SNR regime respectively. They are called 
diversity sum and diversity product respectively, and are given 
by 



V^l = IM: 



P ■ = 



(43) 
(44) 

AIL 



The diversity sum is our familiar metric d = \\a\\ 

(12), resp. d = \\q\\ = \Jtr(l - A t A) = ||sint?|| (29). The 
diversity product acts as a regularity criterion for the positive 
semidefinite matrix A^A, resp. 1 — A^A: In the coherent 
channel case p 2 — det A^ A > is known as diversity criteria 
(resp. rank criteria or determinant criteria) in the literature (e.g. 
[1]). In the non-coherent channel case p = det(l- A f A) = 
(1 - <j\) . . . (1 - a 2 nt ) ( = sin 2 i?i . . . sin 2 i?„ t > measures 
the positivity of the principal angles between ($) and (\&). 

All terms si p resp. Si j in the diversity expansion possess 
the invariance properties induced by (15), resp. (31). Therefore 
the analysis in this section applies to all terms in (42) and the 
result can be stated in closed form for the full diversity, rather 
than only to its first and highest order coefficient. 

Specializing (42) to the non-coherent channel case, one 
checks easily that the coefficients are formally defined on all 
of V® T , but independent of the choice of the representing 
nt -frame. Note that the coherent and non-coherent channel 
diversities are formally similar due to (42), but the constituting 
singular values (14), (28) reflect the underlying topological 
structures induced by the maximum likelihood receivers (11), 
(25) (resp. the metrics d, d). And these structures are entirely 
distinct. 



A. Embedding properties 

Now let us investigate the relation between the non-coherent 
and coherent channel diversity quantities. From the informa- 
tion theoretic inequality I(X; S) < I((X, H); S) between the 
corresponding mutual informations we expect such a relation 
satisfied by the diversity. The ranges for a (14) and q (28) 
indicate, that the coherent channel receiver may benefit from 
some higher 'resolution', but if and how this carries over to 
the diversity is not obvious and requires a rigorous proof. The 
investigations of this section give an affirmative answer to that 
conjecture. 

By a slight abuse of notation let us define the 'fiber minima' 
of Sj. with respect to the fibers of (24) as 



sdistj.($, := 



mm 



(45) 



Then we obtain 



M7) 



Lemma III.l 

Let ($) , (<]/) € G® t t separated by principal angles 
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) 1 ,...,d nt . Then Vj^t) 



Mist 7 .($,*) = H Y[(l~ costf„ t _ l+1 ) (46) 

Due to left invariance of s 7 . we can switch to the canonical nt- 
frame bases * , $o (23) of U C ($). With $ (w)j= 
(m £ U(n t )), running through the fiber over ($o), := 
$o(u)-*o, and |A(u)+A(u) = 1 - H ((cos (recall, that 
(cost?;) = diag(cost?;) (= i...„ t ) we have 

4($o( M ),*o)=n^( A H) 

( ^n2{i-A„ t _ 4+1 r((co S ^H]} 

> V n I 1 ^-<+i ((cos1?i)«) } (47) 

= 2^ [] {l-<r„ t - i+ i((cos0,))} 

ie/ 7 



2^ J](l-cosi?„ t _ i+1 ) 



where (*) comes from the general inequality 

K{ n {A))<o t {A) 



(48) 



devoted to Fan-Hoffman in [31, Prop. III. 5.1]. u = 1 achieves 
equality in (*) and this completes the proof. ■ 
In particular we have the fiber distance 

ddist($, *) := min 

*6^ _1 «*)) 



sdist ?i = 



2^(1-008^) 



(49) 



and its analogon for the diversity product 

pdist($,*):= min 



sdist r 



\ 



(50) 



2™*J|(l-cost? 4 ) 



We observe, that the fiber minima sdist 7 . for each given 
pair ($, are realized by the same choice (^Oj^o). which 
justifies the definition 

sdist.,($,*) := sdist 2 . ( 51 ) 

(thus we have in particular rfdist = Vsdisti and pdist = 
^/sdist„ t ) leading to 



Corollary III.2 

For any pair ($) , (*) £ G® t T we have Vj=i t ..., nf 

«,•(*,*)< sdistr Sj tf ) (52) 

The second inequality holds by definition of sdist. So let us 
turn to the first inequality and denote the principal angles 
between ($) and (*> by i? = . . . , i?„ t )> < d l < f . We 
have = Uiei, sin2 ^ = He/, (! ~ cos2 and sdist/, = 
23 Hieiji 1 ~ costf^-i+i) = n ie/j 2(1 - costf„ t _ m ). For 
any < tf; < f 



holds, thus 



with I* 



2 COS 7?, — cos 2 < 1 



Sj. < sdist j, 



(53) 



(54) 



(nt — i\ + 1, . . . , nt — ij + 1) for each Ij = 



(h, . . . and by 



(54) 



sj. < sdist 7 . = sdist I . = sdist j (55) 
ij ij ij 

the claim follows. ■ 
Given a function s : C x C — > R, let us define s min := 
mincxc s. Then we state 



Corollary III.3 



Vj=i,..., 



(56) 



(unfortunately neither there seems to be a canonical way to 
determine the pairs of points, which realize the minima, nor 
whether this could be achieved simultaneously for each of the 
quantities above by a single pair of points) 

Proof: 

Corollary III. 3 is an easy consequence of Corollary III. 2. For 
each inequality the proof goes the same, so let us take two 
functions f < F, X x X — ► 1,1a discrete set, for brevity. 
Then there are two cases 

1) f min = f(zo,yo) < F(xo,yo) = F min and there is 
nothing to do 

2) f min - f(x ,yo), but F(x ,y ) > F(x llVl ) = F min . 
But then still f min < f(a;i,yi) < F(x uyi ) = F min 
holds. 

■ 

On the metric level (the diversity sum) this inequalities provide 
a distance gain due to the channel knowledge. It increases the 
resolution of the detector and allows the receiver to separate 
points better than the non-coherent channel receiver could do, 
d < d, or equivalently the unit spheres with respect to d 
occupy smaller volume than the corresponding (embedded) 
unit spheres with respect to d, thus one can pack more en- 
spheres into V® T than d-spheres. But due to the famous 
estimate (48) we have proven a considerable stronger result 
not confined to the diversity sum, but rather to any coefficient 
in the diversity expansion (42). Thus we are able to relate 
the inequalities derived so far to the diversity as a whole: 
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Comparing the 'effective' SNRs g (36), g (38) in the diversity 
(42) demands one additional estimate (provided T > n t ) 



T \2 



-0=T 



< 



T 



Q< Q 



(57) 



thus we have 



Proposition III.4 



For any pair ($) , (*) <E G c 



n t ,T 



(58) 



holds. 



Proof: 

The proposition follows directly from (52) and (57) ■ 
So we conclude, that the coherent channel maximum like- 
lihood receiver applied to C G has at least the diversity as 
the non-coherent channel receiver, the diversity grows. This 
approves the information theoretic inequality I(X;S) < 
I((X,H);S) motivating our analysis. 



Having explored the relationship of the embedding GjJ 



n t ,T 



c 



V^ t T let us come to a somewhat complementary scenario, 
which offers the possibility of coding complexity reduction: 
Consider a single fiber over ($). Then, by $w = ($, $-"-) ( <j ), 
there holds a special kind of 'vertical' left invariance, namely 

sj. ($u, $«) = (u, v) , V Ije (jy V u ,veu(n t ) (59) 

where the right hand side is evaluated in U(n t ) = V„ t nf . 
Analogously we define for the special case T — n t : g u := |, 

:= E/ je («>t)(s^.) 2 , TOt^ := E l ^(e C/ ) 1 and we arrive 

at 



Moreover for both parts the code design is easier to solve than 
in (V® T , d): In U(n t ) there are many solutions (i.e. codes) in 
the common literature, e.g. the Alamouti scheme for n t = 2, 
orthogonal designs for n t > 2, quasi-orthogonal space time 
block codes, and many more. The Grassmannian part is also 
simpler (not only concerning dimensions but also) in structure, 
because the 'chordal' design metric d is geometrically more 
natural than the Euclidean distance measure d (in terms of their 
relation to the natural geodesic distance [16]), thus geometric 
methods may apply. Also packings in (G„ t T , d) are already 
in the focus of current research, e.g. [30], [15], whereas 
[30] also contains explicit constructions for packings in the 
(real) Grassmann manifold. In [32] a differential geometric 
connection (based on [16]) has been developed to construct 
space time (and space frequency) codes for for the coherent 
and non-coherent channel case. Further research [17] led to 
space time codes with reduced design complexity by utilizing 
IH.5. 

Remark III.6 

A related question arises, when one considers the task of 
given a code C u , does there exist a code C v with the 
same rate but better performance than C u l Concerning the 
diversity sum d a partial answer gives [16]: The transmit 
power constraint sets the requirement \J~^d mm > d Umm . 
Since there exist a monotonically increasing lower bound 
for d mm when ^ grows (Proposition 1.1) this requirement 
can be certainly fulfilled. This again emphasizes the need 
for coding strategies in the general coding spaces V® T , 
G„ T , T larger than nt- However, it remains an open 
question, whether we can achieve the goal by composed 
codes of the form C v = C G ■ C u . 



Theorem III.5 

Given codes C G C G^ t T C V^ T and C u C U(n t ), then 
the composed code C v C T given by 



c v : =c G -C u = {<S>u\<£eC G ,ueC u } 
satisfies 



>min{sdistf n ,sy min }, Vj 



(60) 



(61) 



and 



(62) 



Viv > mm{Viv min ,Viv Umin } 
holds, whereas Viv 11 := ^2 i (^) l s'i J (g 11 ) 1 (thus the power 
constraint factor \J~^_ sharpens the estimate). 

Proof: The theorem follows directly from Corollary 
III. 3, Proposition III.4 and the definition of sdist as a fiber 
minimum. ■ 
Therefore the code design splits up into two parts: Codes C G 
represent the familiar coding problem for the non-coherent 
channel corresponding to G% t t< which has smaller dimension 
as the general problem in V^ t T . The code C u represents a 
coding problem for the coherent channel in U(n t ) = V® t „ t , 
contributing the dimensions left by V^ t T = G® f t x U(n t ) 
locally. So both parts represent a somewhat smaller coding 
problem with respect to the dimension of the signal spaces. 



Remark III.7 

A conceptual simple (but computational complex) embed- 
ding of G® t T into V^ t T is given by the parametrization 
of G® T with (so-called 'horizontal') tangents X H = 

(° _ o 1 )' B £ & T ~ nt)xnt in its total s P ace U{T). 
In a recent article [21] it has been shown, that coding 
for the non-coherent channel is under certain assumptions 
equivalent to coding on the horizontal tangent space, with 
respect to the coherent channel diversity for T-n t - 
Combining that with Theorem III. 5 we can roughly state 
this correspondence as V^ t T _ nf C G® f T C V® f T , which 
gives rise to a sequence . . . — > Cj — > C G +1 — > CY +1 . 
of codes with increasing block length i- nt, i = 1,2,... 



IV. Extremal properties of the diversity 

In this section we examine the distribution of pairwise an- 
gles in C G to find criteria for maximum diversity in particular 
for the combined code C v =-C G ■ C u in ,/^Vl C t- We focus 
on the diversity sum and diversity product, representing the 
most important diversity quantities (since they dominate the 
small and high SNR regime of diversity) while still being 
simple functions of the principal angles. 

To get some first insight into the interplay between di- 
versity sum and product (with respect to a fixed pair $, * 
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of code symbols) we exploit the homogeneity of the ele- 
mentary symmetric polynomials. For both coherent and non- 
coherent channel case it is quite natural to write a := tAt, 
Si ■= j^pr = sym^of, . . . ,&l t ), The importance of this 
factorization arises from the identity d = \\a\\, thus we can 
now write 



Viv — ^ Si (d 2 gY 



(42') 



which emphasizes the intuitively obvious fact, that scaling 
of g (resp. p) behaves reciprocal to scaling of the distances. 
Moreover, we see that the diversity scales (term wise) with 
(an appropriate power of) the metric d, which means in 
particular that the task of maximizing the diversity behaves 
in its higher order terms (especially the diversity product) like 
a constraint on the packing problem determined by the diver- 
sity sum, contrasting the impression one might have gotten 
by considering only the Chernov bound (35), which seems 
dominated by its highest order term. Consequently we have to 
control p constrained on the unit sphere S4. In summary the 
homogeneity property (42') scales all orders of diversity by 
the pairwise metric distances, turning the diversity orders > 2 
into local quantities. Thus maximizing diversity corresponds 
roughly to locally maximizing the diversity product while 
globally maximizing the diversity sum (constrained packing 
problem). The behavior of the diversity product on large scales 
becomes unimportant due to the contributions of the lower 
order terms. Let us therefore perform a Lagrangian analysis 
for the diversity product constrained on the unit sphere. 

Lagrangian analysis: The non-coherent channel diversity 
sum and product and the corresponding lower bounds for their 
coherent channel analogues (by embedding), are functions of 
type H($) = YTiU or K(d) = f^i M^) with either 

h = sin 2 (for d 2 (29), p 2 (44)) or h = 2(1 - cos) (for ddist 2 
(49), pdist 2 (50)). Their domain of definition is the closed 
simplex 9 of principal angles (see figure 1) 

© := {#= (0i,...,0„ t ) I O<0i < ••• <ti nt < |} (63) 

(the open simplex being 8 := {■& e R rit 1 < #1 < 
• • • < i?„, < 7r/2}) but since the principal angles vary 




Fig. 1. for n t = 3 

like the identity map id for < < n/2 but extend to 
7r — id for tt/2 < #j < ir (considered as a function on the 
aperture angle), the function h (i.e. 1 — cos) fails in general 
to be differentiable transversal to the closed facet H n / 2 of 
containing $ nf = tt/2. Transversal to any other edge h is 



smooth, of course. In order to apply the classical Lagrangian 
formalism of constrained optimization problems in R™* to the 
present situation, we must convince ourselves, that the non- 
smooth edges and the 7r/2-facet of do not interfere. Our 
next task therefore consists of an appropriate decomposition 
of into smooth pieces, which decompose the optimization 
into a series of smaller tasks of one single type, solvable 
simultaneously in R', I < m (formula (67) shows the resulting 
problem formulation). 

We need a little bit more notation. Let 0< := \ H^/ 2 , 
and 90 < := 0< \ the C° boundary manifold of with 
the problematic facet removed. For I — 0, . . . , k the faces 
contained in 0< of dimension I are given by 

9«0< := 



f = #1 = 

< tfpi+i = 



< & 



pi 
= 0* 



Pi 



Pi < ■■■ <Pi 
i — 1 . . .n t — (I — i + 1) 



thus c?W© < consists exactly of those faces in 0<, which are 
given by pi (possibly = 0) zero angles followed by I 'blocks' 
each of equal nonzero angles, in increasing order, in particular 
<9(°)0< = {0}, a( n *)0< = 0. Each face dW©< is a smooth 
submanifold of <90<, with <9(dW0<) = d^ 1 ^ and 0< = 
U™i 1 9^0<. The tangent spaces are 



A, e R 
pi < ■ ■ ■ < pi < pi+i = n t 
Pi = i - 1 . . .nt - (I - i + 1\ 



Lemma IV.l 

Given (0) = ZZi = nr=i K*i) ° n e < 

with h £ C°°([0, 7r/2[) (this means, that h is differentiable 
from the right in 0) and h'(0) — 0. Then for I = 0, . . . , n t 




(>©<) 

g radx |a(i)e< er(a«e<) 



(64) 
(65) 



thus restricting the gradients remains intrinsic. 
Proof: 

We have gradiJ(tf) = . . . , ft'(i?„ t )). Since h'(0) = 

the symmetry of H ensures 

grad J ff| 9(1 )e < (??) = ELi ^'(^K+i)( e P,+i + ■■■ + e P ,+i) G 
T(aW@<), with Aj = h'{d Pi+1 ). Similarly, gradif(^) = 
(h'Wi) Uj^i h(0j))i, thus gradX| 9(1 ) e< (^) = (for Pl > 

0) or grad J fiT| 9(1 )e < (^) = E'=i A *( e P,+i + ■■■ + e P ,+i) G 
T (aW©<) with 

A, = h'(tf B+1 )/ l (tf B+1 )m.-«-in. ?s .^ 1 , J+ i) w+1 -». ■ 
The lemma ensures, that the Lagrangian functional F — 
f — X(g — S) on a neighborhood of for critical points of 
/ obeying the constraint g = 5 (f,g either H or K, 6 € R) 
applies to the boundary 90 <. Since / and g are not necessarily 
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differentiable transversal to H v / 2 , extremal points in lie in 
the set {g = 6} n H w/2 U 0< n {DF = 0} (recall, that n 
has higher precedence than U and D denotes differentiation). 
For a more unified treatment we define ©W := {0 < tii < 
• • • < $i < tt/2} (in particular ©W = and ©("«) = 0) and 
recall, that on the one hand /, g are differentiable tangential to 
Hn/2 and on the other hand i_1 U {$; = ••• = i?„ t = tt/2}, 
I = 1, . . . , n t — 1 exhausts H w / 2 . This leads to the following 
recursion scheme: For I = 0, . . . , n t set 



ir( , )(0i,...,0,) = 5>(0i) + M5)(»t-O 



i=l 



(66) 



^)(^ 1 ,...,i?o=nw- /i (f) 



(n t -0 



and fV\g^ given by either or ifW. Then the extremal 
points in lie in the set 



U (i DFil) 



0} n e<? u 



ti T . 



1=0 



';}) (67) 



whereas := — X(g^ — 5). Furthermore, for h mono- 
tonically increasing on [0,7r/2] and zero at 0, the conditions 
5 forces S e [0,/i(f)n t ] (resp. 6 e [0,/i(f)™]) and 



<VMf )1 = : I- thus 1 restricts to 



j(0 

in the first case Z > |~n t 
{Z, . . . ,n t }. 

Let us now start the Lagrangian analysis of the diversity 
(resp. with the analysis of the various diversity sums and 
products). The non-coherent channel diversity sum/product 
(d)(p) as well as the lower bounds (rfdist)(pdist) for the 
coherent channel analogues depend on the U (n t ) -fibers only 
(moreover they depend only on the principal angles). By left 
invariance we can always assume \I> = ( J ) and consider the 
diversity terms as functions (marked with an g) on the single 
argument $, for which ($) is separated by principal angles 
tie from {(D). 

P 2 , : 

-Q|s s a(«) 

In order to find the maximum of p in the unit distance 
sphere we constrain / = p 2 = Y\ t sin 2 on S d i (6), by setting 
.9 = 4x> — X^sin 2 . Thus we get the Lagrangian functional 
F(t?) = lit sin2 ^ - X(J2t sin 2 ti t - 6), < S < n t . Here we 
have Z — [~n t — <f| and from (67) we get for each I £ {/,..., nt} 



{DF {1) =o}ne 

■ = #! = ...= 



n 



pi 



<ti 



pi+i 



< 



J^sin 2 y pi< j<i 



From this we get extremal points with / ^ only for pi = 0, 
i?i = • • • = tii = 0W with Zsin 2 0« = (5 - (n t - Z) and 
= (sin 2 (() )' = (1 - ^f^-) 1 , monotonically increasing 
with Z, therefore 



maxp 2 , = f (,lt) 
attained in tii = . . . ti n = 9, sin 2 9 - 



S 
n t 

S/n t . 



(68) 



Conclusion IV.2 

Locally the code points ofC G for the non-coherent channel 
have to be distributed with as many of their pairwise 
principal angles to be nonzero and equal in modulus as 
possible. 

In principle, the same holds for the coherent channel, if we 
consider p and d as functions of a: The maximum diversity 
product is attained for a\ = ■■■ = a nt = s, s 2 = 5/n t , 
-2 _ s 2n t _ ^S/n t ) nt , but it seems difficult to characterize all 
$ € ^ (6) subject to d\ = . . . a nt = s. Let us embed C G into 
V® rp instead and investigate the question, which conditions 
have to be imposed on C G in order to achieve some diversity 
gain in terms of ddist and pdist. 

<2dist 2 i : 

For the metric fiber distance cZdist constrained on S d 2 (S) the 
Lagrangian functional reads F(ti) = cZdist 2 ,^) - A(d 2 (i9) - 
S) = 2k — 2 J^i costii — A(5^i sin 2 ^ — 5). Again we have 
< 5 < n t , Z = \n t — 5] and from (67) we now get for each 
Z e {{,... ,n t } 



{DF® = 0} n 0^ } = {.g 

r o = tii = ■ ■ 



(0 



«} 



n 



<ti 



pi+i 



< • • • < tii < 



x = 



cosfij 



? Pl <j<i 



Extremal points are contained in = tii = ■■■ = ti Pl , 
ti Pl+1 = ■■■ = tii = 6»W subject to (Z - Pl )sm 2 9^ = 
S — (n t — Z) for I = I = \n t — S~\ , . . . , nt and pi = 0, . . . , pi = 
[n t - 5\, with f$ = 2(1 - pi)(l - cosflW) + 2(n t - Z) = 
2(Z-p 1 )(l-Vl — 9) + 2(n t -Z), g := = sin 2^). For 

fixed Z the function pi i — > /p^ is monotonically increasing 
(by analyzing the derivative, where defined) and we find 
/« := min pl /« = and /« := max pi /« = /CO. 
As functions of Z both terms turn out to be monotonically 
decreasing (by analyzing the derivatives with respect to Z) and 
we find 



min rfdist 



0\S d 2 (S) 



f 



(nt) 



2n, 



(l - y/l - (5/nt) 



(69a) 



attained in tii = . . . ti nt — 9, sin 2 9 = S/n t , and 



max ddist 



Q \s d 2(S) 



7 (D = 2{l-Vl-(S-[S\))+2[6\ 

(69b) 



attained for tii — . . . ti 



...tir, 



Ln t -«J 

tt/2. 



0, sin 2 ^ r „ t _ 51 =5-[5\, 



V\n t -6-]+l 

pdist 2 i : 

0|s^ 2 (5) 

Examining pdist instead of cZdist we have the Lagrangian 

F(ti) = 2 k 11,(1 - cos^,) - AQTV sin 2 tii - 5) and 

{DF^ = 0} n 0^ - {.gW = (5} n 
= t?i = ••• =ti 



J pi 

< ti Pl+ i < ■ ■ ■ < tii < | 



A = JJ(l-COS<?j) 



COS ti. 



i+3 



V P l<j<l, 
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From this we get extremal points with / ^ only for p\ = 0, 

i?i = • • • = i?j = 0W with Zsin 2 6>C ) = ( 5 - (n t - I) and 

/W = 2 n «(l - cosO^Y = 2 n *(l - yf^f^) 1 , monotonically 
increasing with Z and therfore 

maxpdistV =/("') =2"' (l - ^1 - (70) 
in tfi = ...i? nt = 9, sin 2 9 = <tyn t . 
Remark IV.3 

If we had constrained pdist on 5g dist 2 ((5), < 5 < 2n t 
we would have gotten 

6 \n t 



maxpdist 



os-. 



n t 



(71) 



attained in i?i = . . . § nt = 9, 2(1 — cos9) = S/n t , by 
analogy to the case of p constrained on d. The apparent 
discrepancy of (70) to (71) is caused by (69b), since 
at i?i = ••• 



■Or 



- 9 rfdist attains its minimum 
5 = 2n t (l — \J\ — 8/nt) on 5^(5) which yields (with 
(71))pdist 2 |3distM = (S/n t r = 2"'(1-VT^J7^)"', 
which coincides with (70). 

Conclusion IV.4 

What remains is the general rule, that for g = 1 
(compare (36)) in favor of pdist instead of ddist one 
should distribute the pairwise principal angles in C G 
locally to be all nonzero and equal in modulus (by (70)). 
This coincides with the preferred strategy (68) for the 
non-coherent channel code. For q < 1, when the higher 
order diversity terms become less important, it might be 
better to distribute the pairwise principal angles globally 
to maximize rfdist, thus separating points in C G by as many 
of the principal angles to be zero such that the remaining 
ones attain large values. 

pdist 2 i : 

Finally, to get an product analogue of (69) we analyze pdist 2 
constrained on S p 2 (S), < 5 < 1, thus the Lagrangian reads 
F(0) = 2 fc n 4 ( 1_ - cos ^) " A(ILsin 2 i? 4 - 8) and for I e 
{0, . . . , nt} we have 



7 (0 



{DF® = 0} n = {.9 



= <?i = • • • = P1 

< tfp 1+ l < ■ ■ ■ < t?j < 



5} n 

a = 



2™*- 1 J-r 1 - COS??, 

A A S in 2 ^ 



cos 



V Pl <j<;_ 

<5 > forces pi = and we get $1 = • • • = ??j = 0^, 
si n 2 '6>W = (5, thus /W = 2"'(1 - cos0W) ! = 2"'(1 - 
— which is monotonically decreasing in i, so 



minpdist 



OS, (5) 



/("*) = 2"' (l - v 7 ! -<5 1 /«*y it (72a) 



attained in 



maxpdist 



...■& n , =9, sin 2 9 = S 1 /^, and 



OS, (5) 



(72b) 



attained in sin 2 i?i = 5, $2 = ■ ■ ■ = i?n t = 7r /2 (whereas /(°) 
is contained in when <5 = 1). 

Observing, that the minima in (69) and (72) are monotoni- 
cally increasing in S we end up with 



Proposition IV.5 

In the situation of Proposition III. 4 we have 



ddisV 



> min rfdist 



= 2n t (l - sf\ -d 2min /nt) > d 



(73) 



2 min 



pdist 2mm > minpdist 



Q S 2 (p 2 



(74) 



2 "t ^1 _ yj[ _ p 2min 1 /«t^ > p 2 min 



The benefit of this proposition is, that it relates 5dist min 
(resp. pdist min ) directly to d min (resp. p min ), regardless if the 
minimum distances (resp. diversity products) are realized by 
the same pair of points or not. 

Proof: 

Its only left up to show the second inequality in each formula, 
which is elementary (setting x := d 2mm or x := p 2mm 7 
respectively): 

(73) y/l - x/n t < 1 - x/2n t 1 - x/n t < 

(1 - x/2n t ) 2 = 1 - x/th + x 2 /An 2 . 



(74) 



Vl - a; 1 /"* < 1 



a/fc 



(1 - x 1 / k /2) 2 = 1 - x 1 /^ + x 2 / n * /4 



/2 



1 - a; 1 /™* < 



V. Conclusions 

This work should be seen as a second step towards a 
geometry based analysis of general space time block codes, 
inspired by the results in [16], opening the door to potentially 
high performing space time block codes, when T ^> nt. 
The various estimates and interrelations explored in this work 
assemble the following overall picture: 

• Diversity monotony: The performance analysis revealed 
nice embedding properties (with respect to G® t T C 
Vjh t) °f tne diversity quantities (Corollaries III.2, LU.3, 
(57)), leading to a diversity growth (Proposition III.4) 
in the transition from the non-coherent channel to the 
coherent channel. This turned out to be due to the various 
invariance properties satisfied by the diversity, though tied 
to distinct underlying topologies of the coding spaces 
induced by the maximum likelihood receiver metrics. 
Moreover, for the diversity sum and product, more ex- 
plicit estimates have been derived (Proposition IV.5). 

• Complexity reduction: Embeddings of both G® tT an d 
U (nt) into V^ t x can be used to construct codes on V® T 
from 'smaller' pieces (Theorem III. 5), both of them being 
already in the focus of current research. The other way 
round, given an non-coherent channel space time code 
and a 'small' coherent channel code, the performance of 
the resulting (larger dimensional) product code on V® T 
is lower bounded by the diversity expressions stated in the 
theorem. Thus the design complexity has been reduced 
to the smaller problems on G® T and U(n t ). Together 
with Proposition 1.1 this opens the door to potentially 
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high performing space time block codes, when T 3> n t . 
As already indicated in the introduction this may be of 
some importance in the context of space frequency codes 
also. 

• Localization: The local nature of the higher order diver- 
sity quantities turns space time coding into a constrained 
packing problem. The diversity sum still represents a 
major criteria, locally superposed by the diversity product 
as a rigidity constraint: The optimal code in the high SNR 
regime is packed as 'diagonal' as possible, uniformly 
maximizing the principal angles (Conclusion IV.2 and 
IV.4). 

There are still many open issues. Some immediate will be 
listed next. The explicit bounds of Proposition IV.5 are very 
coarse and improvements are necessary. Moreover, it would be 
desirable to obtain further decompositions in Theorem III.5. 
Furthermore this work has to be related to the differential 
coding scheme [9], which benefits from high rates compared 
to non differential codes. Finally it remains the challenge 
of effective high dimensional code construction (especially 
for the non-coherent channel) with low complexity decoding 
properties. 
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